Dynamic Non-Bayesian Decision Making

نویسندگان

  • Dov Monderer
  • Moshe Tennenholtz
چکیده

The model of a non-Bayesian agent who faces a repeated game with incomplete information against Nature is an appropriate tool for modeling general agent-environment interactions. In such a model the environment state (controlled by Nature) may change arbitrarily, and the feedback/reward function is initially unknown. The agent is not Bayesian, that is he does not form a prior probability neither on the state selection strategy of Nature, nor on his reward function. A policy for the agent is a function which assigns an action to every history of observations and actions. Two basic feedback structures are considered. In one of them { the perfect monitoring case { the agent is able to observe the previous environment state as part of his feedback, while in the other { the imperfect monitoring case { all that is available to the agent is the reward obtained. Both of these settings refer to partially observable processes, where the current environment state is unknown. Our main result refers to the competitive ratio criterion in the perfect monitoring case. We prove the existence of an e cient stochastic policy that ensures that the competitive ratio is obtained at almost all stages with an arbitrarily high probability, where e ciency is measured in terms of rate of convergence. It is further shown that such an optimal policy does not exist in the imperfect monitoring case. Moreover, it is proved that in the perfect monitoring case there does not exist a deterministic policy that satis es our long run optimality criterion. In addition, we discuss the maxmin criterion and prove that a deterministic e cient optimal strategy does exist in the imperfect monitoring case under this criterion. Finally we show that our approach to long-run optimality can be viewed as qualitative, which distinguishes it from previous work in this area.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cost Analysis of Acceptance Sampling Models Using Dynamic Programming and Bayesian Inference Considering Inspection Errors

Acceptance Sampling models have been widely applied in companies for the inspection and testing the raw material as well as the final products. A number of lots of the items are produced in a day in the industries so it may be impossible to inspect/test each item in a lot. The acceptance sampling models only provide the guarantee for the producer and consumer that the items in the lots are acco...

متن کامل

Application of Bayesian decision making tool in detecting oil-water contact in a carbonate reservoir

Detection of Oil-Water Contacts (OWCs) is one of the primary tasks before evaluation of reservoir’s hydrocarbon in place, determining net pay zones and suitable depths for perforation operation. This paper introduces Bayesian decision making tool as an effective technique in OWC detecting using wire line logs. To compare strengths of the suggested method in detecting OWC with conventional one, ...

متن کامل

Justifying Bayesianism by Dynamic Decision Principles

As yet, no general agreement has been reached on whether the Bayesian or the frequentist (Neyman-Pearson, NP) approach to statistics is to be preferred. Whereas Bayesians adhere to coherence conditions of de Finetti, Savage, and others, frequentists do not consider these conditions normative and deliberately and knowingly violate them. Hence further arguments, bringing more clarity on the disag...

متن کامل

Intervention and causality in a dynamic Bayesian network

The use of intervention for time series modelling is a well established technique for on-line forecasting and decision-making in the context of Bayesian dynamic linear models. Intervention has also been recently used in (non-dynamic) Bayesian networks to investigate causal relationships between variables, and in dynamic Bayesian networks to investigate lagged causal relationships between time s...

متن کامل

An Iterative Decision Rule to minimize cost of Acceptance Sampling Plan in Machine Replacement Problem

In this paper, we presented an optimal iterative decision rule for minimizing total cost in designing a sampling plan for machine replacement problem using the approach of dynamic programming and Bayesian inferences. Cost of replacing the machine and cost of defectives produced by machine has been considered in model. Concept of control threshold policy has been applied for decision making. If ...

متن کامل

Increasing effectiveness of model-based fault diagnosis: A Dynamic Bayesian Network design for decision making

This papers aims to design a new approach in order to increase the performance of the decision making in model-based fault diagnosis when signature vectors of various faults are identical or closed. The proposed approach consists on taking into account the knowledge issued from the reliability analysis and the model-based fault diagnosis. The decision making, formalised as a bayesian network, i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. Artif. Intell. Res.

دوره 7  شماره 

صفحات  -

تاریخ انتشار 1997